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101 Streamin g XML: XPath queries on streamin g data 

Feng Peng, Sudarshan S. Chawathe 

June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 
Management of data SIGMOD '03 

Publisher: ACM Press 

Additional Information: full citation , abstract, references , citin gs, index 

terms 




Full text available:^ pdf (433.73 KB ) 



We present the design and implementation of the XSQ system for querying streaming 
XML data using XPath 1.0. Using a clean design based on a hierarchical arrangement of 
pushdown transducers augmented with buffers, XSQ supports features such as multiple 
predicates, closures, and aggregation. XSQ not only provides high throughput, but is also 
memory efficient: It buffers only data that must be buffered by any streaming XPath 
processor. We also present an empirical study of the performance character ... 

102 Online analytic processi ng ( OLAP): Spreadsheets in RDBMS for OLAP 

Andrew Witkowski, Srikanth Bellamkonda, Tolga Bozkaya, Gregory Dorman, Nathan Folkert, 
Abhinav Gupta, Lei Shen, Sankar Subramanian 

June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 
Management of data SIGMOD '03 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , citin gs, index 

terms 



H 




Full text available:^ pdf d 82.22 KB ) 



One of the critical deficiencies of SQL is lack of support for n-dimensional array-based 
computations which are frequent in OLAP environments. Relational OLAP (ROLAP) 
applications have to emulate them using joins, recently introduced SQL Window Functions 
[18] and complex and inefficient CASE expressions. The designated place in SQL for 
specifying calculations is the SELECT clause, which is extremely limiting and forces the 
user to generate queries using nested views, subqueries and complex joins ... 

103 XML indexin g and compression: Efficient processin g of j oins on set-valued attributes Q 

Nikos Mamoulis 

June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 
Management of data SIGMOD '03 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , citin gs, index 
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Full text available: |B pdf( 678.13 KB ) 



http://portal.acm.org/results.cfm? 3/16/2007 



Results (page 6): query optimization bitmap Page 2 of 7 



Object-oriented and object-relational DBMS support set valued attributes, which are a 
natural and concise way to model complex information. However, there has been limited 
research to-date on the evaluation of query operators that apply on sets. In this paper we 
study the join of two relations on their set-valued attributes. Various join types are 
considered, namely the set containment, set equality, and set overlap joins. We show that 
the inverted file, a powerful index for selection queries, c ... 

104 Statistics: Extended wavelets for multiple measures Q 
^ Antonios Deligiannakis, Nick Roussopoulos 

June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 

Management of data SIGMOD '03 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 



Full text available: TO Pdf(322.18 KB) 

^ terms 

While work in recent years has demonstrated that wavelets can be efficiently used to 
compress large quantities of data and provide fast and fairly accurate answers to queries, 
little emphasis has been placed on using wavelets in approximating datasets containing 
multiple measures. Existing decomposition approaches will either operate on each 
measure individually, or treat all measures as a vector of values and process them 
simultaneously. We show in this paper that the resulting individual 

105 Indus trial track session 2: server techn ology: Multi-dim ensional clusterin g : a new Q 

^ data la yout scheme in DB2 

^ Sriram Padmanabhan, Bishwaranjan Bhattacharjee, Tim Malkemus, Leslie Cranston, Matthew 
Huras 

June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 
Management of data SIGMOD '03 

Publisher: ACM Press 

Full text available: ^| pdf(168.35 KB) Additional Information: full citation , abstract , references , index terms 

We describe the design and implementation of a new data layout scheme, called multi- 
dimensional clustering, in DB2 Universal Database Version 8. Many applications, e.g., 
OLAP and data warehousing, process a table or tables in a database using a multi- 
dimensional access paradigm. Currently, most database systems can only support 
organization of a table using a primary clustering index. Secondary indexes are created to 
access the tables when the primary key index is not applicable. Unfortunately, ... 

106 Discoverin g all most s pecific sentences Q 

vAv Dimitrios Gunopulos, Roni Khardon, Heikki Mannila, Sanjeev Saluja, Hannu Toivonen, Ram 
^ Sewak Sharma 

June 2003 ACM Transactions on Database Systems (TODS), volume 28 issue 2 
Publisher: ACM Press 

Additional Information: full citation, abstract , references , citings, index 



Full text available: TLI pdf(283.09 KB) 

terms 

Data mining can be viewed, in many instances, as the task of computing a representation 
of a theory of a model or a database, in particular by finding a set of maximally specific 
sentences satisfying some property. We prove some hardness results that rule out simple 
approaches to solving the problem.The a priori algorithm is an algorithm that has been 
successfully applied to many instances of the problem. We analyze this algorithm, and 
prove that is optimal when the maximally specific sen ... 

Keywords: Data mining, association rules, learning with membership queries, maximal 
frequent sets, minimal keys 
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Lipyeow Lim, Min Wang, Sriram Padmanabhan, Jeffrey Scott Vitter, Ramesh Agarwal Q 
\£7 May 2003 Proceedings of the 12th international conference on World Wide Web 

WWW '03 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings, index 



Full text available: TO pdf (233.78 KB ) 

terms 

Recent work on incremental crawling has enabled the indexed document collection of a 
search engine to be more synchronized with the changing World Wide Web. However, this 
synchronized collection is not immediately searchable, because the keyword index is 
rebuilt from scratch less frequently than the collection can be refreshed. An inverted index 
is usually used to index documents crawled from the web. Complete index rebuild at high 
frequency is expensive. Previous work on incremental inverted in ... 

Keywords: inverted files, update processing 



108 Astrolabe: A robust and scalable technology for distributed system monitoring , Q 
^ management and data mining 

^ Robbert Van Renesse, Kenneth P. Birman, Werner Vogels 

May 2003 ACM Transactions on Computer Systems (TOCS), volume 21 issue 2 

Publisher: ACM Press 

Full text available - Wi pdf(341 62 KB) Adc, itional Information: full citation, abstract , references , citings, index 

■ [Aj ■ terms 

Scalable management and self-organizational capabilities are emerging as central 
requirements for a generation of large-scale, highly dynamic, distributed applications. We 
have developed an entirely new distributed information management system called 
Astrolabe. Astrolabe collects large-scale system state, permitting rapid updates and 
providing on-the-fly attribute aggregation. This latter capability permits an application to 
locate a resource, and also offers a scalable way to track sys ... 

Keywords: Aggregation, epidemic protocols, failure detection, gossip, membership, 
publish-subscribe, scalability 

109 Data pag e layouts for relational databases on deep memory hierarchies Q 

Anastassia Ailamaki, David J. DeWitt, Mark D. Hill 

November 2002 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 11 Issue 3 

Publisher: Springer-Verlag New York, Inc. 

Full text available: ^| pdf f 593.86 KB) Additional Information: full citation , abstract , index terms 

Relational database systems have traditionally optimized for I/O performance and 
organized records sequentially on disk pages using the N-ary Storage Model (NSM) 
(a.k.a., slotted pages). Recent research, however, indicates that cache utilization and 
performance is becoming increasingly important on modern platforms. In this paper, we 
first demonstrate that in-page data placement is the key to high cache performance and 
that NSM exhibits low cache utilization on modern platforms. Next, we ... 

Keywords: Cache-conscious database systems, Disk page layout, Relational data 
placement 



110 Strate g ies for processing ad hoc queries on lar g e data warehouses 
/jfcv Kurt Stockinger, Kesheng Wu, Arie Shoshani 
' November 2002 Proceedings of the 5th ACM international workshop on Data 
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Warehousing and OLAP DOLAP '02 

Publisher: ACM Press 

Full text available' pdf (245.31 KB ) Additional Information: full citation , abstract , references, citings, index 

terms 

As data warehousing applications grow in size, existing data organizations and access 
strategies, such as relational tables and B-tree indexes, are becoming increasingly 
ineffective. The two primary reasons for this are that these datasets involve many 
attributes and the queries on the data usually involve conditions on small subsets of the 
attributes. Two strategies are known to address these difficulties well, namely vertical 
partitioning and bitmap indexes. In this paper, we summarize our exp ... 

111 Web services and performance evaluation: Indexing web access-lo g s for pattern Q 
^ queries 

^ Alexandras Nanopoulos, Yannis Manolopoulos, Maciej Zakrzewicz, Tadeusz Morzy 

November 2002 Proceedings of the 4th international workshop on Web information 

and data management WIDM '02 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citing s, index 



Full text available: IB pdf M87.24 KB ) 

terms 

In this paper, we develop a new indexing method for large web access-logs. We are 
concerned with pattern queries, which advocate the search for access sequences that . 
contain certain query patterns. This kind of queries find applications in processing web-log 
mining results (e.g., finding typical/atypical access-sequences). The proposed method 
focuses on scalability to web-logs' sizes. For this reason, we examine the gains due to 
signature-trees, which can further improve the scalability to very ... 

112 Addin g subqueries to MySQL, what does it take to have a decision-su p port engine? 

vjgk Antonio Badia, Matt Chanda, Bin Cao 

^ November 2002 Proceedings of the 5th ACM international workshop on Data 

Warehousing and OLAP DOLAP '02 

Publisher: ACM Press 

Full text available: 1S| pdf(2Q0.24 KB ) Additional Information: full citation , abstract , references , index terms 



MySQL is an OLTP system without the ability to handle SQL subqueries. As part of our 
project in query optimization, we add subquery processing to MySQL through rewriting. 
First, we incorporate the ability to handle subqueries in the FROM clause; then rewrite 
queries with subqueries in the WHERE clause by moving the subquery to the FROM clause 
and introducing suitable conditions to link the subquery and the main query tables. The 
result is a complete and correct unnesting procedure for SQL querie ... 

Keywords: SLQ, query optimization, unnesting 



113 Efficient similarity search for market basket data 
Alexandros Nanopoulos, Yannis Manolopoulos 

October 2002 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 11 Issue 2 

Publisher: Springer-Verlag New York, Inc. 

Full text available:^) pdf(21 2.88 KB) Additional Information: full citation , abstract , citings , index terms 

Several organizations have developed very large market basket databases for the 
maintenance of customer transactions. New applications, e.g., Web recommendation 
systems, present the requirement for processing similarity queries in market basket 
databases. In this paper, we propose a novel scheme for similarity search queries in 
basket data. We develop a new representation method, which, in contrast to existing 
approaches, is proven to provide correct results. New algorithms are proposed for the ... 
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114 Ex ploiting common subqueries for complex q uery o ptimization Q 

Yingying Tao, Qiang Zhu, Calisto Zuzarte 

September 2002 Proceedings of the 2002 conference of the Centre for Advanced 

Studies on Collaborative research CASCON '02 

Publisher: IBM Press 

Full text available: ^ pdf(300.77 KB) Additional Information: full citation , abstract , references , index terms 

As database technology is applied to more and more application areas, user queries on a 
database become more and more complex. Existing query optimization techniques were 
not developed for dealing with complex queries and may suffer from some serious 
problem such as intolerably long optimization time and poor optimizing results. To tackle 
this challenge, we introduce a new technique to improve the quality of complex query 
optimization in this paper. The key idea is to exploit the common subqueri ... 

115 Frequent patterns I: DualMiner: a dual-prunin g alg orithm for itemsets with constraints |jj 
Cristian Bucila, Johannes Gehrke, Daniel Kifer, Walker White 

July 2002 Proceedings of the eighth ACM SIGKDD international conference on 
Knowledge discovery and data mining KDD '02 

Publisher: ACM Press 

Full text available - 155 pdf( 1 22 MB) Additional Information: full citation , abstract , references , citing s, index 

k- 1 terms 

Constraint-based mining of itemsets for questions such as "find all frequent itemsets 
where the total price is at least $50" has received much attention recently. Two classes of 
constraints, monotone and antimonotone, have been identified as very useful. There are 
algorithms that efficiently take advantage of either one of these two classes, but no 
previous algorithms can efficiently handle both types of constraints simultaneously. In this 
paper, we present the first algorithm (called Dua ... 

116 Research sessions: implementation techniques: Fractal prefetchin g B—Trees: Q 
o ptimizin g both cache and disk performance 

Shimin Chen, Phillip B. Gibbons, Todd C. Mowry, Gary Valentin 
June 2002 Proceedings of the 2002 ACM SIGMOD international conference on 
Management of data SIGMOD '02 

Publisher: ACM Press 

Full text available- f53 pdfd 49 MB) Additional Information: full citation, abstract, references, citings, index 

■ [Aj^— ^ >- terms 

B + -Trees have been traditionally optimized for I/O performance with disk pages as tree 
nodes. Recently, researchers have proposed new types of B + ~Trees optimized for CPU 
cache performance in main memory environments, where the tree node sizes are one or a 
few cache lines. Unfortunately, due primarily to this large discrepancy in optimal node 
sizes, existing disk-optimized B + -Trees suffer from poor cache performance while cache- 
optimized B + -Trees exhibi ... 




117 Research sessions: continuous queries and streams: Continuously adaptive 
^ continuous queries over streams 

^ Samuel Madden, Mehul Shah, Joseph. M. Hellerstein, Vijayshankar Raman 

June 2002 Proceedings of the 2002 ACM SIGMOD international conference on 

Management of data SIGMOD '02 
Publisher: ACM Press 

Full text available- I?) pdf(1 59 MB) Additional Information: full citation , abstract , references , citings, index 
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We present a continuously adaptive, continuous query (CACQ) implementation based on 
the eddy query processing framework. We show that our design provides significant 
performance benefits over existing approaches to evaluating continuous queries, not only 
because of its adaptivity, but also because of the aggressive cross-query sharing of work 
and space that it enables. By breaking the abstraction of shared relational algebra 
expressions, our Telegraph CACQ implementation is able to share physica ... 

118 Research sessions: quer y processin g II: Efficient k-NN search on verticall y Q 
^ decomposed data 

^ Arjen P. de Vries, Nikos Mamoulis, Niels Nes, Martin Kersten 

June 2002 Proceedings of the 2002 ACM SIGMOD international conference on 

Management of data SIGMOD '02 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citing s, index 



Full text available: TO pdf(1.26MB ) 

terms 

Applications like multimedia retrieval require efficient support for similarity search on 
large data collections. Yet, nearest neighbor search is a difficult problem in high 
dimensional spaces, rendering efficient applications hard to realize: index structures 
degrade rapidly with increasing dimensionality, while sequential search is not an 
attractive solution for repositories with millions of objects. This paper approaches the 
problem from a different angle. A solution is sought in an unconvent ... 

119 Research sessions: q uery processing I: Partial res ults for online query processin g Q 
Vijayshankar Raman, Joseph M. Hellerstein 

June 2002 Proceedings of the 2002 ACM SIGMOD international conference on 
Management of data SIGMOD '02 

Publisher: ACM Press 

Full text available* IS Ddfd 45 MB) Additional Information: full citation , abstract , references , citing s, index 
u . terms 

Traditional query processors generate full, accurate query results, either in batch or in 
pipelined fashion. We argue that this strict model is too rigid for exploratory queries over 
diverse and distributed data sources, such as sources on the Internet. Instead, we 
propose a looser model of querying in which a user submits a broad initial query outline, 
and the system continually generates partial result tuples that may contain values for 
only some of the output fields. The user can watch ... 






120 Research sessions: implementation techniques: Implementin g database operations 

usin g SIMP instructions 
Jingren Zhou, Kenneth A. Ross 

June 2002 Proceedings of the 2002 ACM SIGMOD international conference on 
Management of data SIGMOD '02 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , citing s, index 



Full text available: 153 pdf(1.39 MB) 

" terms 

Modern CPUs have instructions that allow basic operations to be performed on several 
data elements in parallel. These instructions are called SIMD instructions, since they apply 
a single instruction to multiple data elements. SIMD technology was initially built into 
commodity processors in order to accelerate the performance of multimedia applications. 
SIMD instructions provide new opportunities for database engine design and 
implementation. We study various kinds of operations in a database con ... 
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Displaying records #11 through 20 out of 364 
Result # 11 Relevance: OOO '* 

Process for Representing Screen-Sized Graphics as an Icon 

1994-03-01 IPCOM000111726D English 

The invention is a process for selectively converting either a vector representation of a 
graphic or, alternately, a bit-map representation of a screen-sized graphic into an icon. 



Relevance: OOOlVC* 
of Using Compressed Bit-Map for Memory Utilization 

IPCOM000120439D English 
ie invented compressed bit map as a better way of keeping track of the b 



Result # 13 Relevance: 

Improved Graphic Image Bitmap Structure for Personal Computers 

1994-07-01 IPCOM000113081D English 

Described is an architectural implementation designed to improve the graphic image bil 
and controlling apparatus and to enable horizontal, vertical, and quadrangle area acces 
Personal Computers (PCs). The implementation relates to the graphic ... 



Result # 14 Relevance: OOOftO 

Method to Dynamically Allocate Bitmaps on a Per Page Basis for Perfc 
Improvement in the Adobe PDF Rip Process 

2006-03-08 IPCOM000134525D English 

Disclosed is a method(s) to force the Adobe* Configurable PostScript** Interpreter (i.e 
render into a new raster bitmap for each output bitmap it produces when interpreting e 
(Portable Document Format) or PostScript language programs. The reason for ... 



Result # 15 Relevance: OOO^'O 
Inter-Program Bitmap Data Distribution 

1989-11-01 IPCOM000036909D English 

This article describes a method for interchanging bitmaps in a mul tasking environment 
independent applications which requires minimal system overhead and minimizes progr 
to interchange bitmap data between applications. Inter-Program Bitmap Data ... 



Result # 16 Relevance: OOOL U 
Extracting Data From Bitmap in Display 

1990-02-01 IPCOM000099662D English 

In graphics displays, data to be operated on by a graphics engine may be stored in a p.' 
bitmap. This disclosure is concerned with extracting data from a single plane of a pixel 
another pixel bitmap. The technique described enables only selected bits ... 



Result # 17 Relevance: OO 

Recursive Dilation for Visual ICON Selection 

1994-04-01 IPCOM000111985D English 

A method is presented for an efficient process for providing a technique for CUA's defin 
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Result # 18 Relevance: OO . ~ • 
Hand-Held CRT Scanner 

1990-06-01 IPCOM000100907D 



English 



A solid-state camera and a motor-driven X-Y table are used with the distortion measure 
shown in Fig. 1. Test patterns displayed on the CRT are digitized with this equipment si 
map of the image is created for data manipulation. Seven minutes is required ... 



Result # 19 Relevance: OO V 

Time-Lapse Controlled Bitmap Display of Complex Graphical Data 

1994-06-01 IPCOM000112652D English 

Described is a method of displaying a complex graphic image on a Personal Computer ( 
windows-based application. A bitmap image is built in store and called to display it as n 
redraws avoiding repeated creations from scratch. While an application is ... 



Result # 20 Relevance: OOV-V- 



AUTOMATED REGRESSION TESTING USING A VIRTUAL PRINTER 



Disclosed is a method for fully automating the traditionally manual process of regressio 
printer management software. The following scenarios are addressed, which previously 
manual intervention: hardware or physical media failures; simultaneous disablement ... 
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Result # 1 Relevance: OOOOO 

DISTRIBUTED QUERY PROCESSING IN LOCAL - NETWORK DATABASES 

1984-12-31 IPCOM000128335D English 

to Distributed Databases 1.2 Introduction to Query Pr 

Purpose of the Thesis . , 1.4 Thesis Plan 

page 2 Distributed ... 
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